Technical supplement to “ Consistent probabilistic outputs for protein function prediction ”
نویسندگان
چکیده
Protein function prediction, in the context of the Gene Ontology, is a task that consists of answering, for a fixed protein X, a large number of binary questions of the form: " Does protein X belong to GO term Y ? " Those binary classification problems are strongly related because the ontology consists of nested classes. Two natural requirements for this prediction problem are • that the set of predictions be consistent, i.e., that if a protein is assigned a GO term, then it is all also assigned all the ancester GO terms, and • that high-confidence predictions can be produced with a quantified confidence level. Methods of structured classification proposed in machine learning Taskar et al. [2003] could in theory be used to tackle this problem. However, two practical difficulties that need to be surmounted are the large amount of missing data and the large scale of the 1
منابع مشابه
ارائه یک مدل احتمالاتی برای توزیع خوردگی یکنواخت در سکوهای ثابت فلزی در خلیج فارس
For structural reliability assessment or risk analysis of aging offshore steel structures, it is essential to have a probabilistic model, which contains specific statistical parameters, and predicts long term corrosion loss as a function of time. The aim of this study is to propose such model for offshore jacket platforms in the Persian Gulf. Field measurements for material loss due to uniform ...
متن کاملApplication of Linear Regression and Artificial NeuralNetwork for Broiler Chicken Growth Performance Prediction
This study was conducted to investigate the prediction of growth performance using linear regression and artificial neural network (ANN) in broiler chicken. Artificial neural networks (ANNs) are powerful tools for modeling systems in a wide range of applications. The ANN model with a back propagation algorithm successfully learned the relationship between the inputs of metabolizable energy (kca...
متن کاملFuzzy Integral Based Data Fusion for Protein Function Prediction
Data fusion using diverse biological data has been applied to predict the protein function in recent years. In this paper, fuzzy integral fusion based on fuzzy measure is used to integrate the probabilistic outputs of different classifiers. Support vector machines as base learners are applied to predict the functions of examples from each data source. Fuzzy density values are determined by Part...
متن کاملProtein Secondary Structure Prediction: a Literature Review with Focus on Machine Learning Approaches
DNA sequence, containing all genetic traits is not a functional entity. Instead, it transfers to protein sequences by transcription and translation processes. This protein sequence takes on a 3D structure later, which is a functional unit and can manage biological interactions using the information encoded in DNA. Every life process one can figure is undertaken by proteins with specific functio...
متن کاملOn the use of back propagation and radial basis function neural networks in surface roughness prediction
Various artificial neural networks types are examined and compared for the prediction of surface roughness in manufacturing technology. The aim of the study is to evaluate different kinds of neural networks and observe their performance and applicability on the same problem. More specifically, feed-forward artificial neural networks are trained with three different back propagation algorithms, ...
متن کامل